Distributional reinforcement learning with unconstrained monotonic neural networks
نویسندگان
چکیده
The distributional reinforcement learning (RL) approach advocates for representing the complete probability distribution of random return instead only modelling its expectation. A RL algorithm may be characterised by two main components, namely representation together with parameterisation and metric defining loss. present research work considers unconstrained monotonic neural network (UMNN) architecture, a universal approximator continuous functions which is particularly well suited different representations distribution. This property enables efficient decoupling effect function class from that metric. paper firstly introduces methodology (PDF, CDF QF). Secondly, novel named deep Q-network (UMDQN) presented. To authors' knowledge, it first method supporting three, valid Lastly, in light this new algorithm, an empirical comparison performed between three quasi-metrics, Kullback-Leibler divergence, Cramer distance, Wasserstein distance. results highlight strengths weaknesses associated each important limitation
منابع مشابه
Reinforcement Learning in Neural Networks: A Survey
In recent years, researches on reinforcement learning (RL) have focused on bridging the gap between adaptive optimal control and bio-inspired learning techniques. Neural network reinforcement learning (NNRL) is among the most popular algorithms in the RL framework. The advantage of using neural networks enables the RL to search for optimal policies more efficiently in several real-life applicat...
متن کاملReinforcement Learning in Neural Networks: A Survey
In recent years, researches on reinforcement learning (RL) have focused on bridging the gap between adaptive optimal control and bio-inspired learning techniques. Neural network reinforcement learning (NNRL) is among the most popular algorithms in the RL framework. The advantage of using neural networks enables the RL to search for optimal policies more efficiently in several real-life applicat...
متن کاملreinforcement learning in neural networks: a survey
in recent years, researches on reinforcement learning (rl) have focused on bridging the gap between adaptive optimal control and bio-inspired learning techniques. neural network reinforcement learning (nnrl) is among the most popular algorithms in the rl framework. the advantage of using neural networks enables the rl to search for optimal policies more efficiently in several real-life applicat...
متن کاملPlanning with neural networks and reinforcement learning
planning with neural networks, time limits of discounted reinforcement learning Planning, taskability, Dyna-PI architectures Dyna-PI architectures: focussing, forward and backward planning, acting and (re)planning. Tested with... Ideas from problem solving and
متن کاملStable reinforcement learning with recurrent neural networks
In this paper, we present a technique for ensuring the stability of a large class of adaptively controlled systems. We combine IQC models of both the controlled system and the controller with a method of filtering control parameter updates to ensure stable behavior of the controlled system under adaptation of the controller. We present a specific application to a system that uses recurrent neur...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Neurocomputing
سال: 2023
ISSN: ['0925-2312', '1872-8286']
DOI: https://doi.org/10.1016/j.neucom.2023.02.049